🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ Performance Mythology

Benchmarking Folklore, Optimization Legends, Speed Misconceptions, Profiling Truth

Understanding why deterministic output from LLMs is nearly impossible
unstract.com·18h·
Discuss: Hacker News
🎯Performance Proofs
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
arxiv.org·6h
⚙️Compression Benchmarking
Postgres 18 beta2: large server, Insert Benchmark
smalldatum.blogspot.com·1d·
Discuss: smalldatum.blogspot.com
🗄️PostgreSQL WAL
Show HN: LLM-benchmark – Make LLMs fight for fastest ops/SEC on your code
github.com·6h·
Discuss: Hacker News
🔄Reproducible Builds
Giving Benchmarks a Boat
buttondown.com·16h·
Discuss: Lobsters, Hacker News
🎯Performance Forensics
Which Backend Is Better for Speed? We Ran 1 Million Tests to Find Out
hackernoon.com·1d
🎬WebCodecs
GLM-4.5 Teardown: Is This the GPT-4 and Claude Killer We've Been Waiting For?
algogist.com·15h·
Discuss: Hacker News
🎯Emulator Accuracy
Nvidia's N1X processor: As many shader cores as in a GeForce RTX 5070
heise.de·21h
🖥️Terminal Renaissance
Intelligent Data Movement and Data Placement dictate the future of AI Data Infrastructure
storagegaga.com·11h
🏠Homelab Archaeology
tcmalloc's Temeraire: A Hugepage-Aware Allocator
paulcavallaro.com·21h·
Discuss: Hacker News, r/compsci, r/programming
💾Memory Mapping
Machine Learning Fundamentals: k-means example
dev.to·19h·
Discuss: DEV
👁️Observatory Systems
Optimizing enterprise AI assistants: How Crypto.com uses LLM reasoning and feedback for enhanced efficiency
aws.amazon.com·17h
✨Effect Handlers
Show HN: Whisper at 1.58 bits with custom kernels for edge inference
medium.com·21h·
Discuss: Hacker News
📊Quantization
[P] Sub-millisecond GPU Task Queue: Optimized CUDA Kernels for Small-Batch ML Inference on GTX 1650.
reddit.com·2d·
Discuss: r/MachineLearning
📊Performance Profiling
AI reshapes the craft of software engineering, with Yoav Tzfati
complexsystemspodcast.com·11h·
Discuss: Hacker News
🔄Language Evolution
Can small AI models think as well as large ones?
seangoedecke.com·2d
📊Quantization
Reminiscence Attack on Residuals: Exploiting Approximate Machine Unlearning for Privacy
arxiv.org·6h
💻Local LLMs
LoRA-PAR: A Flexible Dual-System LoRA Partitioning Approach to Efficient LLM Fine-Tuning
arxiv.org·6h
💻Local LLMs
Contraction Hierarchies: HMC Clinic Project Recap
blog.appliedcomputing.io·8h·
Discuss: Lobsters, Hacker News
🎯Performance Proofs
Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security
arxiv.org·1d
🔒Language-based security
Loading...Loading more...
AboutBlogChangelogRoadmap